Sum-of-Squares Based Cluster Validity Index and Significance Analysis

نویسندگان

  • Qinpei Zhao
  • Mantao Xu
  • Pasi Fränti
چکیده

Different clustering algorithms achieve different results to certain data sets because most clustering algorithms are sensitive to the input parameters and the structure of data sets. Cluster validity, as the way of evaluating the result of the clustering algorithms, is one of the problems in cluster analysis. In this paper, we build up a framework for cluster validity process, meanwhile a sum-of-squares based index for cluster validity purpose is proposed. For homogeneous data based on independent variables, the proposed clustering validity index is effective in comparison to some other commonly used indexes. We use resampling method in the framework to analyze the stability of clustering algorithm, and the certainty of cluster validity index also.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

WB-index: A sum-of-squares based index for cluster validity

Article history: Received 5 September 2012 Received in revised form 2 May 2014 Accepted 11 July 2014 Available online 17 July 2014 Determining the number of clusters is an important part of cluster validity that has been widely studied in cluster analysis. Sum-of-squares based indices show promising properties in terms of determining the number of clusters. However, knee point detection is ofte...

متن کامل

مدل ساختاری فرایند توانمند‌سازی منابع انسانی دانش‌بنیان

One of the main concerns of managers today is the empowerment of human resources and their application to knowledge-based companies. The ultimate goal of this research, based on the strategy of exploratory mix methods, was to create and test the grounded theory about the process. A model that was based on theoretical studies and analysis of deep interviews with knowledge base experts was design...

متن کامل

Nonparametric Least Squares Regression and Testing in E Conomic Models

This paper proposes a tractable and consistent estimator of the (possibly multi-equation) nonparametric regression model. The estimator is based on least squares over sets of functions bounded in Sobolev norm and is closely related to penalized least squares. We establish consistency and rate of convergence results as well as asymptotic normality of the (suitably standardized) sum of squared re...

متن کامل

Validation and Localization of the Persian Version of Short form the Index of Ability and Readiness of Performing the Mission in Military Nurses

Background and Aim: Nursing is an important subset of the health care system to act in critical situations. Military and civilian nurses are among the first to appear on the scene and provide services in the event of an accident or disaster, and military nurses play a double role in times of crisis due to their special security dimension. Assessing the capability and readiness of military nurse...

متن کامل

Analysis of genotype × environment interaction for seed yield of promising Kabuli type chickpea (Cicer arietinum L.) promising lines

One of the most complicated issues in plant breeding programs is genotype by environment interaction. To evaluate seed yield stability of 18 chickpea promising lines, a field experiment was conducted using randomized complete block design with four replications in two cropping seasons (2018-19 and 2019-2020) in four research stations (Maragheh, Kurdistan West Azerbaijan and Hamedan), Iran. Comb...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009